A Comparative Evaluation of GMM-Free State Tying Methods for ASR

نویسندگان

  • Tamás Grósz
  • Gábor Gosztolya
  • László Tóth
چکیده

Deep neural network (DNN) based speech recognizers have recently replaced Gaussian mixture (GMM) based systems as the state-of-the-art. While some of the modeling techniques developed for the GMM based framework may directly be applied to HMM/DNN systems, others may be inappropriate. One such example is the creation of context-dependent tied states, for which an efficient decision tree state tying method exists. The tied states used to train DNNs are usually obtained using the same tying algorithm, even though it is based on likelihoods of Gaussians, hence it is more appropriate for HMM/GMMs. Recently, however, several refinements have been published which seek to adapt the state tying algorithm to the HMM/DNN hybrid architecture. Unfortunately, these studies reported results on different (and sometimes very small) datasets, which does not allow their direct comparison. Here, we tested four of these methods on the same LVCSR task, and compared their performance under the same circumstances. We found that, besides changing the input of the context-dependent state tying algorithm, it is worth adjusting the tying criterion as well. The methods which utilized a decision criterion designed directly for neural networks consistently, and significantly, outperformed those which employed the standard Gaussian-based algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of voice activity detection by combining multiple features with weight adaptation

For noise-robust automatic speech recognition (ASR), we propose a novel voice activity detection (VAD) method based on a combination of multiple features. The scheme uses a weighted combination of four conventional VAD features: amplitude level, zero crossing rate, spectral information, and Gaussian mixture model (GMM) likelihood. The weights for combination are adaptively updated using minimum...

متن کامل

Robust decision tree state tying for continuous speech recognition

In this paper, methods of improving the robustness and accuracy of acoustic modeling using decision tree based state tying are described. A new two-level segmental clustering approach is devised which combines the decision tree based state tying with agglomerative clustering of rare acoustic phonetic events. In addition, a unified maximum likelihood framework for incorporating both phonetic and...

متن کامل

Isip 2000 Conversational Speech Evaluation System

In this paper, we describe the ISIP Automatic Speech Recognition system (ISIP-ASR) used for the Hub-5 2000 English evaluations. The system is a public domain cross-word context-dependent HMM based system and has all the functionality normally expected in an LVCSR system, including Baum-Welch training for continuous density HMMs, phonetic decision tree-based state-tying, word graph generation an...

متن کامل

Factor Analyzed Gaussian Speaker Identif

In this paper, the statistical method of Factor Analysis(FA) is studied on Gaussian Mixture Model(GMM) based speaker identification(SI) system to model the data covariance which is usually neglected due to the training data sparseness. Because the variance of GMM can represents speaker variability, it is very important in SI systems. By FA modeled the data covariance, a relative gain of 39.6% o...

متن کامل

Comparative Evaluation of Some Properties of Chicken and Japanese Quail Eggs Subjected to Different Storage Methods

This study investigated the potential effects of egg quality indices at 95% confidence level in order to minimize quality loss during different storage conditions. The chicken and quail eggs’ quality indices including weight, albumen index, yolk index, Haugh index in fresh eggs as well as after storing in moist sawdust, oil, and refrigerator were measured for six weeks. The results revealed tha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017